# Vietnamese OCR
Vintern 3B R Beta
MIT
Vintern-3B-R-beta is a multimodal large language model focused on complex reasoning tasks based on images, capable of decomposing reasoning steps and effectively controlling hallucination phenomena.
Image-to-Text
Transformers Supports Multiple Languages

V
5CD-AI
1,841
14
Erax VL 7B V2.0 Preview I1 GGUF
Apache-2.0
This is the result of applying weight/importance matrix quantization to the EraX-VL-7B-V2.0-Preview model, offering multiple quantization versions to suit different needs
Image-to-Text Supports Multiple Languages
E
mradermacher
246
1
Vintern 1B V3 5
MIT
Vintern-1B-v3.5 is a multimodal large language model fine-tuned based on InternVL2.5-1B, specializing in Vietnamese text processing, excelling in OCR and understanding Vietnamese-specific documents.
Image-to-Text
Transformers Supports Multiple Languages

V
5CD-AI
6,875
35
Erax VL 7B V2.0 Preview
Apache-2.0
EraX-VL-7B-V2.0-Preview is a powerful multimodal model designed for OCR and visual question answering, excelling in processing multiple languages including Vietnamese, with outstanding performance in recognizing medical forms, invoices, and other documents.
Image-to-Text
Transformers Supports Multiple Languages

E
erax-ai
476
22
Erax VL 2B V1.5 I1 GGUF
Apache-2.0
EraX-VL-2B-V1.5 is a multimodal foundation model supporting Vietnamese, English, and Chinese, with capabilities for image-to-text and image-text-to-text conversion.
Image-to-Text Supports Multiple Languages
E
mradermacher
467
0
Pretrained Trocr Small Vietnamese Nom
A model focused on Vietnamese speech recognition, supporting high-accuracy speech-to-text conversion.
Machine Translation
Transformers Other

P
nxquang-al
19
2
Featured Recommended AI Models